UP | HOME

Date: [2020-11-24 Tue]

Eliezer Yudkowsky

Table of Contents

Eliezer Shlomo Yudkowsky (born September 11, 1979) is an American Artificial intelligence (AI) researcher and writer best known for popularizing the idea of friendly artificial intelligence.

An autodidact, Yudkowsky did not attend high school or college.

Noting the difficulty of formally specifying general-purpose goals by hand, Russell and Norvig cite Yudkowsky's proposal that autonomous and adaptive systems be designed to learn correct behavior over time:

Yudkowsky (2008) goes into more detail about how to design a Friendly AI. He asserts that friendliness (a desire not to harm humans) should be designed in from the start, but that the designers should recognize both that their own designs may be flawed, and that the robot will learn and evolve over time. Thus the challenge is one of mechanism design—to design a mechanism for evolving AI under a system of checks and balances, and to give the systems utility functions that will remain friendly in the face of such changes.

In response to the instrumental convergence concern, where autonomous decision-making systems with poorly designed goals would have default incentives to mistreat humans, Yudkowsky and other MIRI researchers have recommended that work be done to specify software agents that converge on safe default behaviors even when their goals are misspecified.

In the intelligence explosion scenario hypothesized by I. J. Good, recursively self-improving AI systems quickly transition from subhuman general intelligence to superintelligent.

1. MIRI - Machine Intelligence Research Institute

Over 300 blogposts by Yudkowsky on philosophy and science (originally written on LessWrong and Overcoming Bias) were released as an ebook entitled Rationality: From AI to Zombies by the Machine Intelligence Research Institute (MIRI) in 2015. MIRI has also published Inadequate Equilibria, Yudkowsky's 2017 ebook on the subject of societal inefficiencies.


References


Backlinks


You can send your feedback, queries here